Separating Brands from Types: an Investigation of Different Features for the Food Domain

نویسندگان

  • Michael Wiegand
  • Dietrich Klakow
چکیده

We examine the task of separating types from brands in the food domain. Framing the problem as a ranking task, we convert simple textual features extracted from a domain-specific corpus into a ranker without the need of labeled training data. Such method should rank brands (e.g. sprite) higher than types (e.g. lemonade). Apart from that, we also exploit knowledge induced by semisupervised graph-based clustering for two different purposes. On the one hand, we produce an auxiliary categorization of food items according to the Food Guide Pyramid, and assume that a food item is a type when it belongs to a category unlikely to contain brands. On the other hand, we directly model the task of brand detection using seeds provided by the output of the textual ranking features. We also harness Wikipedia articles as an additional knowledge source.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Investigation on Crash Worthiness of Different Vehicle Brands: A Case Study of Rollover Crashes

This study aimed at indexing crash worthiness capability of 20 most frequently used car brands in Iran. Since rollover crashes are one of the most important crash types due to their high impact on crash severity, they were chosen as the case study of the current research. In this regard, the data of 42,118 rollover crashes of urban and rural roads of Iran which occurred from 2009 to 2012 was us...

متن کامل

Investigation of Toxic Metals in the Tobacco of Different Iranian Cigarette Brands and Related Health Issues

Objective(s) The primary objective of this study was to determine whether local and imported cigarette brands used in Iran, have elevated levels of metals or not. The produced data of cigarette brands are compared both with each other and with the existing brands in different countries. Materials and Methods In present study, nineteen various cigarettes brands were randomly purchased from th...

متن کامل

A new technique for bearing fault detection in the time-frequency domain

This paper presents a new Fast Kurtogram Method in the time-frequency domain using novel types of statistical features instead of the kurtosis. For this study, the problem of four classes for Bearing Fault Detection is investigated using various statistical features. This research is conducted in four stages. At first, the stability of each feature for each fault mode is investigated. Then, res...

متن کامل

Quality and Safety Assessment of Bangladeshi Pasteurized Milk

Background: Milk is considered as one of the highly nutritious food for human. This study was undertaken to evaluate the physicochemical as well as the microbial quality of pasteurized milk of different brands available in Chittagong, Bangladesh. Methods: Five types of branded pasteurized liquid milk were collected from retail markets of Chittagong, Bangladesh. Physicochemical analyses were ca...

متن کامل

A note on the problem when FS-domains coincide with RB-domains

In this paper, we introduce the notion of super finitely separating functions which gives a characterization of RB-domains. Then we prove that FS-domains and RB-domains are equivalent in some special cases by the following three claims: a dcpo is an RB-domain if and only if there exists an approximate identity for it consisting of super finitely separating functions; a consistent join-semilatti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014